A rate-distortion optimization model for SVC inter-layer encoding and bitstream extraction

نویسندگان

  • Wen-Hsiao Peng
  • John Kar-Kin Zao
  • Hsueh-Ting Huang
  • Tse-Wei Wang
  • Lin-Shung Huang
چکیده

The emerging Scalable Video Coding (SVC) Extension of H.264/AVC standard enables partial extraction and decoding of video bitstreams, and thus allows viewing devices to adapt their video reception and playback according to devices capability and network performance. This desirable feature, however, comes with a caveat: the parts of a SVC bitstream needed for good quality playback at different devices may differ significantly depending on the visual characteristics of video contents, the quantization and dependency settings at the SVC encoders as well as the display formats and the error concealment mechanisms adopted by the viewing devices. In this paper, we present the results of our investigation on the intricate relations among these factors. We discovered a set of constraints on the setting of quantization parameters Qp and the choices of reference layers for inter-layer dependencies that ensures good rate-distortion (R-D) trade-offs and regular bitstream extraction paths. We called these constraints, the adaptation rules for inter-layer encoding, and their encoding results, the well-adapted SVC layers and bitstreams. We further discovered that bitstream extraction performed by different viewing devices may follow predictable paths if the SVC bitstream is well-adapted and the playback process extracts a complete set of interdependent NAL units at every successive refinement step. This regularity in bitstream extraction enabled us to develop an scalable bitstream extraction algorithm based on local optimization of rate-distortion ratios. A host of experiments was perfromed using the Joint Scalable Video Model 9 (JSVM v.9) of SVC encoder/decoder. Their purpose was to study the effect of Qp settings and reference layer choices on the R-D performance of SVC bitstreams with respect to different video contents, device types, objective vs. subjective distortion measurements and spatiotemporal interpolation algorithms for error concealment. The experimental results based on standard video test sequences are presented in this paper to support our findings.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Motion-refined rewriting of H.264/AVC-coded video to SVC streams

In this paper, we discuss motion-refined rewriting of single-layer H.264/AVC streams to SVC streams with multiple quality layers. First, we elaborate on techniques we developed for efficient rewriting of residual data from H.264/AVC to SVC. We investigate if rate-distortion performance can further be improved by extending these architectures with motion refinement techniques, which exploit the ...

متن کامل

Quality Estimation for H.264/SVC Inter-layer Residual Prediction in Spatial Scalability

Scalable Video Coding (SVC) provides an efficient compression for the video bitstream equipped with various scalable configurations. H.264 scalable extension (H.264/SVC) is the most recent scalable coding standard. It involves the state-of-the-art inter-layer prediction to provide higher coding efficiency than previous standards. Moreover, the requirements for the video quality on distinct situ...

متن کامل

SVC-based scalable multiple description video coding and optimization of encoding configuration

It is well known that multiple multicast trees can avoid single point of failures in peerto-peer (P2P) streaming originating from a single source by providing path diversity. Multiple description video coding is well suited for serving video over multiple multicast trees, such that each description can be streamed over a different tree. In scalable multiple description coding (SMDC), each descr...

متن کامل

A fast algorithm of bitstream extraction using distortion prediction based on simulated annealing

Scalable video streams can be extracted to meet the bandwidth limitation of different networks and end-users. Bitstream extraction is usually performed at the network proxy or gateway during transmission, where a low computational complexity is always preferred. How to quickly and accurately select the best resolution combination for a video to meet different bandwidth requirements by each user...

متن کامل

Efficient Block Mode Determination Algorithm Using Adaptive Search Direction Information for Scalable Video Coding (SVC)

Scalable video coding (SVC) is an extension of H.264/AVC for providing three types of scalabilities: temporal, spatial, and quality. These scalabilities can be combined to make use of network bandwidth. As an extension of the H.264/AVC video standard, an inter-mode prediction technique is used for reducing temporal as well as spatial redundancies by using seven variable-sized macroblocks (MBs) ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Visual Communication and Image Representation

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2008